📝 update deployment examples, add kserve #226

joerunde · 2025-06-10T20:55:47Z

This PR:

Updates the k8s example to add the required scheduler name and switch to a startup probe with a proper progress deadline
Adds a page with a kserve example for RHOAI usage

Signed-off-by: Joe Runde <[email protected]>

github-actions · 2025-06-10T20:56:02Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

joerunde · 2025-06-10T21:14:45Z

bot:test
MARKERS="embedding"

rafvasq

I'll read through the RHOAI doc tomorrow, quick note that it's not in .nav.yml though!

joerunde · 2025-06-10T21:52:12Z

Thanks!
Also cc @tjohnson31415 what I don't know how to do with kserve is correctly setup the ports so that you can just curl the predictor service. On my cluster I can seemingly hit the service if I set the port manually, but I don't think I should have to do that.

This connects:

curl http://granite-3-1-8b-instruct-predictor.a1-vllm-spyre:8000/v1/completions

But this refuses a connection:

curl http://granite-3-1-8b-instruct-predictor.a1-vllm-spyre/v1/completions

(I'm probably just missing something simple)

docs/.nav.yml

docs/deploying/rhoai.md

rafvasq · 2025-06-11T14:36:44Z

docs/deploying/rhoai.md

+3. Deploy and Test
+
+      Apply the manifests using `oc apply -f <filename>`:
+
+      ```console
+      oc apply -f servingruntime.yaml
+      oc apply -f inferenceservice.yaml
+      ```


I think you could just include these apply lines in each step above after defining the manifests and then use this step for a "Perform an inference request" example.

👍
I went ahead with the heredoc pattern in each section above (oc apply -f - <<EOF) and just linked out to the relevant kserve docs here for how to setup inference with a vllm deployment so we don't have to repeat that info

docs/deploying/rhoai.md

prashantgupta24 · 2025-06-11T23:31:30Z

I think there was a way to fix the sign off commit suggestions directly from the PR page, but can't remember 🤔

Expected "Joe Runde [email protected]", but got "Joe Runde [email protected]".

joerunde · 2025-06-11T23:32:00Z

yeah that confuses me, since my regular commits (that pass DCO) are all signed off with [email protected] :(

Co-authored-by: Rafael Vasquez <[email protected]> Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-06-11T23:34:53Z

@prashantgupta24 for some reason github defaulted to my ibm email for the commit author, but my personal email for the signoff. 🤷

git commit --amend --reset-author
git push -f

fixes it

Signed-off-by: Joe Runde <[email protected]>

rafvasq

My last nit, thanks Joe!

docs/deploying/rhoai.md

Co-authored-by: Rafael Vasquez <[email protected]> Signed-off-by: Joe Runde <[email protected]>

📝 update deployment examples, add kserve

bc84051

Signed-off-by: Joe Runde <[email protected]>

joerunde requested a review from rafvasq as a code owner June 10, 2025 20:55

rafvasq reviewed Jun 10, 2025

View reviewed changes

rafvasq requested changes Jun 11, 2025

View reviewed changes

prashantgupta24 reviewed Jun 11, 2025

View reviewed changes

docs/deploying/rhoai.md Show resolved Hide resolved

Apply suggestions from code review

356ad82

Co-authored-by: Rafael Vasquez <[email protected]> Signed-off-by: Joe Runde <[email protected]>

joerunde force-pushed the rhoai-examples branch from 8f2ee71 to 356ad82 Compare June 11, 2025 23:34

joerunde added 4 commits June 11, 2025 17:39

⚗️ try list withi note

273365b

Signed-off-by: Joe Runde <[email protected]>

🐛 add openshift ai doc to nav page

af23869

Signed-off-by: Joe Runde <[email protected]>

🐛 add to nav section too

e0c1f37

Signed-off-by: Joe Runde <[email protected]>

📝 link to kserve docs for inference

200cec5

Signed-off-by: Joe Runde <[email protected]>

rafvasq approved these changes Jun 16, 2025

View reviewed changes

docs/deploying/rhoai.md Outdated Show resolved Hide resolved

Update docs/deploying/rhoai.md

c45d32a

Co-authored-by: Rafael Vasquez <[email protected]> Signed-off-by: Joe Runde <[email protected]>

joerunde force-pushed the rhoai-examples branch from 30ce807 to c45d32a Compare June 16, 2025 21:16

rafvasq merged commit d56423f into main Jun 17, 2025
19 checks passed

rafvasq deleted the rhoai-examples branch June 17, 2025 12:50

rafvasq mentioned this pull request Jul 14, 2025

Documents a bit CB script and tests #300

Merged

📝 update deployment examples, add kserve #226

📝 update deployment examples, add kserve #226

Uh oh!

Conversation

joerunde commented Jun 10, 2025

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

joerunde commented Jun 10, 2025

Uh oh!

rafvasq left a comment

Choose a reason for hiding this comment

Uh oh!

joerunde commented Jun 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rafvasq Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

joerunde Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prashantgupta24 commented Jun 11, 2025

Uh oh!

joerunde commented Jun 11, 2025

Uh oh!

joerunde commented Jun 11, 2025

Uh oh!

rafvasq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants